FAS - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components
نویسندگان
چکیده
Data warehouses offer a compromise between freshness of data and query evaluation times. However, a fixed preference ratio between these two variables is too undifferentiated. With our approach, clients submit a query together with an explicit freshness limit as a new Quality-of-Service parameter. Our architecture is a cluster of databases. The contribution of this article is the design, implementation, and evaluation of a coordination middleware. It schedules and routes updates and queries to cluster nodes, aiming at a high throughput of OLAP queries. The core of the middleware is a new protocol called FAS (FreshnessAware Scheduling) with the following qualitative characteristics: (1) The requested freshness limit of queries is always met, and (2) data accessed within a transaction is consistent, independent of its freshness. Our evaluation shows that FAS has the following nice properties: OLAP queryevaluation times are close (within 10%) to the ones of an idealistic setup with no updates. FAS allows to effectively trade ’upto-dateness’ for query performance. Even when all queries request fresh data, FAS clearly outperforms synchronous replication. Finally, mean response times are independent of the cluster size (up to 128 nodes). ∗ Current affiliation: Otto-von-Guericke-Universität Magdeburg, Germany Permission to copy without fee all or part of this material is granted provided that the copies are not made or distributed for direct commercial advantage, the VLDB copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of the Very Large Data Base Endowment. To copy otherwise, or to republish, requires a fee and/or special permission from the Endowment. Proceedings of the 28th VLDB Conference, Hong Kong, China, 2002
منابع مشابه
OLAP with a Database Cluster
This chapter presents a new approach to online decision support systems that is scalable, fast, and capable of analysing up-to-date data. It is based on a database cluster: a cluster of commercial off-the-shelf computers as hardware infrastructure and off-the-shelf database management systems as transactional storage managers. We focus on central architectural issues and on the performance impl...
متن کاملAccess control in ultra-large-scale systems using a data-centric middleware
The primary characteristic of an Ultra-Large-Scale (ULS) system is ultra-large size on any related dimension. A ULS system is generally considered as a system-of-systems with heterogeneous nodes and autonomous domains. As the size of a system-of-systems grows, and interoperability demand between sub-systems is increased, achieving more scalable and dynamic access control system becomes an im...
متن کاملFraîcheur et validité de données répliquées dans des environnements transactionnels
We propose a framework for managing the quality of data replicated optimistically on a database cluster. It is based on a model of quality of data. Qualitalively, we make the difference between “freshness” and “validity” of data. Quantitatively, data quality is expressed through divergence measures between the data read and the same data with perfect quality. Users specify a minimum level of qu...
متن کاملOlap Query Processing in Grids * * Work Partially Funded by Capes-cofecub (daad Project), Cnpq-inria (gridata Project), French Anr Massive Data (respire Project) and the European Strep Grid4all Project
OLAP query processing is critical for enterprise grids. Capitalizing on our experience with the ParGRES database cluster, we propose a middleware solution, GParGRES, which exploits database replication and interand intra-query parallelism to efficiently support OLAP queries in a grid. GParGRES has been partially implemented as database grid services on Grid5000. We give preliminary experimental...
متن کاملReplica Refresh Strategies in a Database Cluster
Relaxing replica freshness has been exploited in database clusters to optimize load balancing. However, in most approaches, refreshment is typically coupled with other functions such as routing or scheduling, which make it hard to analyze the impact of the refresh strategy itself on performance. In this paper, we propose to support routing-independent refresh strategies in a database cluster wi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002